How can we figure out what is inside thousands of spreadsheets?

نویسنده

  • Thomas Levine
چکیده

We have enough data today that we it may not be realistic to understand all of them. In hopes of vaguely understanding these data, I have been developing methods for exploring the contents of large collections of weakly structured spreadsheets. We can get some feel for the contents of these collections by assembling metadata about many spreadsheets and run otherwise typical analyses on the data-about-data; this gives us some understanding patterns in data publishing and a crude understanding of the contents. I have also developed spreadsheet-specific search tools that try to find related spreadsheets based on similarities in implicit schema. By running crude statistics across many disparate datasets, we can learn a lot about unweildy collections of poorly structured data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

یادداشت های مدیر مسئول

The heroism of the human has various manifestations. It can be drawn in a very vast range; It can be conclude the philanthropic actions of a social reformer and useful innovations of a natural sciences scientist to sacrifice the forced labor child and the very highlighted role of a spiritual leader in this field. For example, the Hollywood cinema's figures that their function in many cases is t...

متن کامل

ابهام زدایی از تفکر طراحی و شاخص های آن

Design thinking attends to designing manners to figure out common patterns to describe what designers do and how they interact with complex problems. These data can help improve designer‘s skills and other design insights. There are lacks of studies in design thinking in Iranian design studies. This issue causes defecting the understanding of design thinking principles and specifications....

متن کامل

Welcome to virosphere

Viruses may seem alien, but they are the most abundant and, arguably, the most important organisms on Earth. They are found just about everywhere, from oceans and forests to the people around you and, of course, in and on you as well. This world of strange, quasi-living things has been dubbed the virosphere, and it is a mysterious one – we know less about viruses than any other life form. But t...

متن کامل

Acceleration of Islamic Revolution's Victory Process Based on McLuhan's Theory

This article mainly focuses on how and with the aid of what media Islamic revolution's leadership spread revolution's ideology among the masses and when necessary he mobilized and organized people. To address this question McLuhan's classification of media into hot and cool has been applied to study the influence of media by Imam Khomeini and acceleration of Islamic revolution's victory. Having...

متن کامل

O1: Defining Talent: A Cultural Perspective

What is talent? How can it be identified? Who is responsible for identifying it? Are there universally valued talents, or are they all culturally bound? There are at least three different levels of analysis to explore these questions. On the government level, we must philosophically decide on how our country chooses and expresses its values through what is taught, to whom, and for what periods ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014